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Abstract 



In this paper, we derive the optimal transmitter/receiver beamforming vectors and relay weighting matrix for the 
^ ■ multiple-input multiple-output amplify-and-forward relay channel. The analysis is accomplished in two steps. In the 

first step, the direct link between the transmitter (Tx) and receiver (Rx) is ignored and we show that the transmitter and 
the relay should map their signals to the strongest right singular vectors of the Tx-relay and relay-Rx channels. Based 
on the distributions of these vectors for independent identically distributed (i.i.d.) Rayleigh channels, the Grassmannian 
codebooks are used for quantizing and sending back the channel information to the transmitter and the relay. The 
simulation results show that even a few number of bits can considerably increase the link reliability in terms of bit 
error rate. For the second step, the direct link is considered in the problem model and we derive the optimization 
problem that identifies the optimal Tx beamforming vector. For the i.i.d Rayleigh channels, we show that the solution 
to this problem is uniformly distributed on the unit sphere and we justify the appropriateness of the Grassmannian 
codebook (for determining the optimal beamforming vector), both analytically and by simulation. Finally, a modified 



1/^ , quantizing scheme is presented which introduces a negligible degradation in the system performance but significantly 



reduces the required number of feedback bits. 



in 
d 

^-H ' Index Terms 

' Multiple-input multiple-output systems, Amplify-and-forward relaying, Grassmannian criterion, Beamforming, 



Bit error rate. 

I. Introduction 

The multiple-input multiple-output (MIMO) technology provides a wireless system with a large number of degrees 
of freedom, which can be used for increasing the capacity and/or reliability of the wireless links. Relaying techniques, 
on the other hand, can extend the communication range and coverage, by supporting the shadowed users through 
the relay nodes, and reduce the transmission power required to reach the users far from the base station. These 
benefits make MIMO relaying techniques a powerful candidate for implementation in the next generation of wireless 
networks. 

Considering a system with a single data stream and perfect channel knowledge at the receiver, several methods 
can be used to achieve the benefits of the MIMO link. Maximum ratio transmission and receiving (MRT-MRC) 
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[1] is one of the simplest methods which can achieve full diversity order while providing considerable array gains 
compared to space- time codes [2]. This gain is obtained at the expense of the channel knowledge at the transmitter 
and therefore, the receiver needs to send the quantized channel information back to the transmitter. While a general 
purpose MMSE quantizer can be used to describe each channel matrix entry, it requires a large number of feedback 
bits and does not preserve the structure of the optimal beamforming vector [3]. A more efficient approach is to have 
a common beamforming- vector codebook with finite cardinality and send back the label of the best beamforming 
vector to transmitter. This codebook is designed offline and is known to the transmitter and the receiver. For the case 
of flat Rayleigh fading channel, the codebook design problem has been shown to be related to the Grassmannian 
line packing problem [4, 5, 6J. 

In this paper, we generalize the idea of MRT-MRC to a MIMO link with an amplify-and-forward relay station. 
The scenario, considered in this paper, comprises a transmitter (Tx), a receiver (Rx) and a relay which helps the 
transmitter to send its data to the receiver. A general information theoretic analysis of MIMO relay Unk has been 
presented in [7 J and [8J. Although an efficient signaling through the relay channel requires a full-duplex relay with 
specific processing capabilities (e.g. encoding/decoding), amplify-and-forward (AF) relays are stiU attractive due to 
their lower complexity. Moreover, the fuU-duplex assumption cannot be realized by the current technology, as the 
input and output signals need to be separated in time or frequency at the relay. For these reasons, this paper focuses 
on the half-duplex AF relay system. In such a system, the transmitter sends out its symbol in the first time slot and 
the relay and the receiver receive their signal. In the second time slot, the transmitter remains silent and the relay 
multiplies its received signal by a matrix (amplification) and sends the resulting signal to the receiver. The receiver 
decodes the transmitted symbol based on the signals received in two consecutive slots. 

The half-duplex MIMO AF scenario has been considered in [9] and [10], where the authors present different 
solutions for maximizing the instantaneous capacity with respect to the weighting (amplification) matrix of the 
relay. These papers assume no channel state information at the transmitter (CSIT) and consider uniform power 
allocation over the Tx antermas. The work in [11] considers the same problem with perfect CSIT and derives the 
optimal power allocation scheme for the transmitter and relay (without considering the Tx-Rx Unk). Our problem 
setup is different from these papers in two major aspects, listed below: 

• The objective of the aforementioned references is the maximization of the instantaneous capacity. Our problem, 
however, can be categorized as a beamforming problem, where we optimize the Tx/Rx beamforming vectors 
and the relay matrix to maximize the signal-to-noise ratio (SNR) of a single data stream at the Rx output. 

• The above papers assume either no channel information or complete channel information at the transmitter or 
the relay. Our work, however, focuses on a "limited feedback" system, where the receiver end of a Unk sends 
the properly quantized channel information back to the transmitter end. 

The analysis in this paper starts by first ignoring the direct link between transmitter and receiver, where we show 
that the transmitter and the relay should map their symbols to the strongest right singular vectors of the Tx-relay 
and relay-Rx channels. For Rayleigh fading channels, these vectors are uniformly distributed on the unit sphere and 
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therefore the Grassmannian criterion can be used separately for Tx-relay and relay-Rx codebook design. 

In the second part of the paper, we include the direct link in the system model. As expected, one needs to know 
both Tx-relay and Tx-Rx channel matrices to determine the optimal Tx beamforming vector for this case. We first 
assume that such a knowledge is available (for example at the relay), and we derive the optimization problem 
that characterizes the optimal Tx beamforming vector. Although this problem does not appear to have an analytic 
solution, we are able to show that for i.i.d. Rayleigh channels the solution to this problem is uniformly distributed 
on the unit sphere, based on which, the appropriateness of the Grassmannian quantizer can be shown analytically. 

In the next step, we relax the assumption of complete knowledge of the Tx-relay and Tx-Rx channels. Without 
this assumption, the Rx and relay should somehow exchange their information of the Tx-relay and Tx-Rx channels. 
We focus on a scheme, where the Rx quantizes the Tx-Rx channel matrix and sends it to the relay, which already 
knows the Tx-relay channel matrix. Assuming an ideal scalar quantizer for the singular values of the Tx-Rx channel 
matrix, we justify the use of the Grassmannian quantizer for quantizing the singular vectors. Finally, we present 
a modified quantizer, which only quantizes the strongest singular vector of the Tx-Rx channel and sends it to the 
relay. This quantizer requires fewer number of feedback bits and performs very close to the original quantizer. 

The remainder of this paper is organized as follows. In Section II, we present a brief introduction to Grassmannian 
line packing problem and its connection to the MIMO beamforming codebook design. Section III presents the 
problem setup and the solution for the MIMO relay channel without considering the direct link. In Section IV, the 
beamforming codebook design problem is solved with the direct link included in the system model. The simulation 
results are discussed in Section V. Finally, Section VI concludes the paper 

Notations: II and C denote the set of real and complex numbers. Bold upper case and lower case letters denote 
matrices and vectors. I shows the identity matrix. denotes the set of all unitary matrices in C™^™. | • | and 
II • II show the absolute value of a scalar and the Euclidean norm of a vector || • H^^, denotes the Frobenius norm of a 
matri)(Q. {-y" ™d (•)^ denote the transpose and Hermitian of a matrix. The notation = diag„jxn('/'ii 'i>2, • • ■ , 4>r) 
with r = min{m, n} shows a rectangular diagonal matrix ^ E (pnixn ^j^j^ j) — (f,. for i — 1, 2, • • • , r and 
€>(i, j) — for i ^ j. For an arbitrary matrix H e the singular value decomposition (SVD) of H is expressed 

as H = USV^, where U e U"^ and V e Z^/" include the left and right singular vectors as their columns, and 
S ~ diag„xn('''ij • • • where r = min{m,n} and (Ji>(T2> ■ ■ ■ >(Jr>0; if R — rank(H), the first R 

nonzero diagonal enterics of S are called the singular values of H. CAf{Q, S) represents a circularly symmetric 
complex Gaussian distribution with zero mean and covariance matrix S. Finally, E{ } denotes the expectation 
operation. 

II. MIMO Beamforming Codebook Design and Grassmannian Line Packing 

The connection between Grassmannian line packing problem and beamforming codebook design for a Rayleigh 
fading channel has been independently observed in [5] and [6] . Consider the MIMO channel in Fig. 1 . The transmitter 

'||A|||, = ^ I^JiiP = Trace(AA^) = J^j. ct^, where uj^'s are the singular values of the matrix A = [aij]. 
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z~CN(0,/) 



^^PH 



T — ► ^out 



Fig. 1. Single stream MIMO link with Tx and Rx beamforming. 



maps the symbol Xin to the antenna array using the beamforming vector s. The signal passes through the channel 
a/PH with complex Gaussian noise z ^ CN{0,V). The receiver recovers the symbol Xout using the receive 
beamforming vector r. The matrix \/PH e C'^™ models the flat fading channel and m and I are the number of 
the Tx and Rx antennas respectively. The entries of H are assumed to be independent and identically distributed 
according to CJ\f{0, 1). The coefficient P is referred to as the "Unk signal-to-noise ratio (SNR)". The output symbol 
can be expressed as 

Xout = y/Pr"'RsXin + r"z. 

Assuming a transmission power constraint of 1, satisfied by E{|a;i„|^} = 1 and ||s|| = 1, the received SNR 7 is: 

Plr^HsP 

^ = — iriT2 — ' 
IfII 

which should be maximized with respect to r and s. Maximization with respect to r is achieved by matching 
r = Hs, hence the optimal s should maximize 7 = P||Hs||^. It is easy to show that the optimal s is the right 
singular vector of H corresponding to its largest singular value. If we denote the largest singular value and the 
corresponding right singular of H by cti and vi, the optimal Tx beamforming vector is equal to s* = vi and the 
maximum SNR is 7* = Paf. 

For the Rayleigh fading channel matrix H, the singular vectors have been shown to be uniformly distributed on 
the unit sphere in C" (see [5], [12]). Therefore, a good quantizer of the optimal s, in a sense, should place its 
codebook vectors uniformly on the unit sphere. This requirement can be shown to be related to the criterion used 
in the Grassmannian Une packing problem, which we describe next. 

Consider the complex space C™ and let O be the unit sphere, O = {w e C^IHwH = 1}. Define the distance of 
two unit vectors to be sine of the angle between them: 

d(wi,W2) = - |wfw2P, (1) 

forwi,W2 e fi. For a codebook C = {wi, W2, •• • , wat} with distinct unit vectors, define 5(C) as the minimum 
distance of the codebook: 

6{C) = min d{wi,Wj). 

For a fixed dimension m and codebook size N, the Grassmannian Une packing problem [4] is that of finding a 
codebook C of size A'' with the largest minimum distance. Many researchers have studied the solution to this 
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problem for moderate values of m and N [13], [14]. However, there is no known standard way of finding these 
codebooks in general. 

For the problem setup in Fig. 1, consider a beamforming codebook C{N,S) of size N and minimum distance 
6. The receiver chooses the vector in this codebook that maximizes the SNR and sends the label of this vector 
back to the transmitter. Let 7 denote the resulting SNR: 7 = maxwec ^'||Hvirp. The authors in [5] have used the 
distribution of optimal beamforming vector s* to bound the average SNR loss as: 



where m is the space dimension (number of Tx antennas). The upper bound in (2) is a decreasing function of S, for 
any m > 1. Therefore, to minimize the upper bound of the SNR loss, we should maximize the minimum distance 
of the codebook. This is the same criterion used in the definition of the Grassmannian line packing problem and 
establishes the connection between the beamforming codebook design problem and the Grassmannian line packing. 

Before concluding this section, we mention that the codebook design problem for the beamforming system in 
Fig. 1 has been generalized by [15] to the multiplexing systems, where the Tx transmits multiple substreams to the 
Rx. In such systems, the transmitter and receiver share a codebook of preceding matrices and the receiver sends 
back the label of the matrix that maximizes a certain performance criterion (e.g. the minimum substream SNR). In 
this paper, we take the first step in designing the limited feedback systems for beamforming over MIMO AF relay 
channels. The generalization of the relay problem to the case of multiple data streams is considered as the future 
work. 



In this section, we consider the MIMO amphfy-and-forward (AF) relay channel without the direct link and 
derive the optimal transmitter/receiver beanoforming vectors and relay weighting matrix in Subsection III.A. Next, 
we present the quantization scheme in Subsection III.B. It should be noted that if the relay performs decode-and- 
forward, the MIMO relay channel reduces to two MIMO links in series, therefore the optimal structure and the 
quantization scheme in Section I can be applied to each of the links separately. However, the derivation of the 
optimal unquantized scheme and designing the corresponding quantization scheme is not trivial when the relay 
performs amplify-and-forward. 

A. Optimal Unquantized Scheme 

Consider the MIMO amplify-and-forward relay system in Fig. 2a, where the direct link between transmitter and 
receiver is ignored. The transmitter, the relay and the receiver are equipped with m, n and I antermas, respectively. 
The matrices a/^Hi e C"^™ and \/^H2 e C'^" model the flat fading channels of the Tx-relay and relay-Rx 
links, respectively. The coefficients Pi and P2 are referred to as Tx-relay and relay-Rx "link SNRs". The transmitter 
uses the vector s for beamforming. The relay multipUes its noisy received signal by the matrix W € C'^' and 



E{7*}-E{7} < 




(2) 



III. MIMO Amplify and Forward Relay Channel without the Direct Link 
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Fig. 2. (a) MIMO amplify-and-forward relay channel model without the direct link, (b) The model after the change of variables x 

^2 



VTTV^His and y = v^U-f^Hf r. 



sends it to the receiver. The receiver recovers its symbol using the receive beamforming (combining) vector r. We 
assume power constraints equal to 1 at the transmitter and the relay outputs. 

The problem is to find the optimal s, W and r, to maximize the SNR at the receiver output subject to power 
constraints at the Tx and at the relay. For this problem setup, a reasonable solution is "matching", as described below. 
The transmitter should map its symbol to the strongest right singular vector of Hi (as described in Section 11). The 
relay should absorb maximum signal power by matching to the effective channel^ HiS, scale the resulting (noisy) 
signal to meet its power constraint and transmit it through the strongest right singular vector of H2. Finally, the 
receiver should match to the relay-Rx link by using the strongest left singular vector of H2 as the Rx beamformer. 
This matching solution is depicted in Fig. 3a, in which 

Hi = A*B^, 

H2 = F*G^, (3) 

are the SVD decompositions of Hi and H2, and 

A=[ai|a2|---|a„] eZ^", F =[fi|f2| • • • |f,] e Z^', 

B=[bi|b2| • • • |b,„] e U^, G=[gi|g2| • • • |g„] e 

*=diag„xm{0i, 02, • ■ ■, 0ri},*=diag,x„{'0i, "02, ■ • ■■.i'r^}-. 

where r\ — min{n, m}, r-i — min{/, n\. Although matching seems to be the natural solution to this problem, 
showing that the optimal W is a rank one matrix and that matching is optimal is not trivial. This is mainly due 

^This matching vector is parallel to the strongest left singular vector of Hi. 
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to the noise amplification at the relay, which generates colored noise at the receiver input. In the remainder of this 
section, we present a proof for the optimahty of this scheme. 
The relay and receiver output signals in Fig. 2a are: 



Xout =V^r^H2WHisXi„ + v^r^HsWzi + r^za, 

Xre(ai/ = -\/^WHiSa;i„ + Wzi, 

where zi ~ CAf{0, 1) and Z2 ~ CJ\f{0, 1) are the complex Gaussian noise vectors at the relay and Rx input. The 
transmitter power constraint is satisfied by letting E{|a;j„|^} = 1 and ||s|| = 1. Also, the relay power constraint, 
which limits the power of the amplified signal and noise, can be expressed as: 

E{Ke;a,ir} = Pi IIWHisll V ||W||^ = 1. 

Finally, the "received SNR" can be written as: 



P1P2 Ir^HaWHisj^ 
^~ PallW^^HfrllVllrlP' 



where we can assume ||r|| = 1, without loss of generaUty. The optimization problem can be summarized as: 

P1P2 |r^H2WHis|' 

max ■ 5 — (4) 

P2||W^Hfr|| +1 

s.t. 

Pi||WHis||' + ||W||2 =1 
WeC'>^', see™, reC". 
Theorem 1: The optimal values of Tx/Rx beamforming vectors and relay weighting matrix for the SNR maxi- 
mization problem in (4) are given by: 

s* = bi, r* = fi, W* = agiai^, 

where we have used the SVD equations in (3), and a = (l + Pi4>i) ^ ■ Note that the optimal weighting matrix 
W* is a rank one matrix. 

Proof: The optimization is accomplished in two steps. In the first step, we fix s and r and maximize the 
objective with respect to W. In the second step, optimal s and r are derived after substituting the optimal W in 
the SNR expression. 

Step 1) Maximization with respect to W.- 

Define hi = y^His and h2 = y^H^r. By fixing s and r, hi and h2 are also fixed. Let ci = ||hi|| and 

C2 = ||h2|!. 

Consider W = USV^ as the SVD of W, where U,V e and S = diag;x({o-i, CT2, • • • ,ai}. The 
calculations provided below perform the optimization with respect to U, V and S. 
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Define x — V^hi and y = U^h2, which impose the constraints ||x|| — ||hi|| = c\ and ||y|| = ||h2|| — C2 on 
X = [xi, 0:2, • • • , X]^ , y — [yi, 2/2, • • • 1 ViV ^ The maximization with respect to U, V and S, i.e. (4), can 
now be rephrased as a maximization with respect to x, y and S; 



max 



s.t. 



[y^'Sxl 
|Syf + 



(5) 



IWII 



l|x|| -ci 

l|y|| = C2 

Xi, Ut eG, CTi > 0, i = 1,2, • • 
where the power constraint of the relay is computed as follows: 

Fi||WHis||V||W||2 ^ IjUSV^hif „ „^ 

The problem in (5) is exactly the SNR maximization problem for the (single-hop) MIMO link depicted in Fig. 2b, 
where x and y are the transmit and receive beamformers and S is the channel. Note that the only constraint on 
the receiver beamformer y is on its Euclidean norm, therefore, the optimal y is the minimum mean square error 
(MMSE) filteJl Hence, the optimal y and the corresponding SNR are: 



y=c 



1 



Ex 



(6) 



7-x«S fs^ + 



Sx, (7) 

where + is the covariance matrix of the equivalent noise and Sx is the equivalent channel from the input 
symbol to the receiver input. The scalar c is chosen to satisfy the constraint ||y|| — C2- 

For the next step, we find an upper bound for the SNR expression in (7) by considering the constraints on x/s 
and (Ti's, and we present the optimal values of x and S that achieve this upper bound. Considering (7), we get to 
the following maximization problem. 

I 2 

max V|a:,p ' (8) 
s.t. 

[ l|x!|=ci 

e c, > 0, i = 1,2, - • • 



^For a general input-output relation Xout = y^(ha;i„ +z), the optimal (SNR maximizing) receiver beamforming vector is the MMSE filter 
y = cK^^h for K being the covariance matrix of z and any scalar c. The resulting (maximum) SNR is 7 = h^K^^h. 
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Define Pi = = "pTIp- Clearly, < Pi < I and J2i=i f^i — 1- Now, consider the objective function in (8): 



where C Ei '''i^kil^- Th^ inequality in (9) is a result of the concavity of the function for t > 0. 

Now, from the second constraint of the problem (8), we have: 

i i i i i 

Therefore, af > and by applying the same constraint, we can bound (: 



(10) 



c = E-»W = i-E-^'^TT^ 



(11) 



Finally, by combining (10) and (11), and noting that (10) is increasing in we have the following upper bound 
for the SNR: 

^2 2 

By reconsidering the problem in (5), it is easy to check that the following choices of x, S and y satisfy the 
constraints and achieve the upper bound in (12). 

x=[ci,0,---,0f , y=[c2,0,---,0]^, S=diag,xi{(7A---,0}, (13) 

where cr = (l + cf ) ^ . Recalling the definitions of x, y, ci and C2, the optimal values in (13) can be achieved by: 

V=[hi|vi| • • • |v,_i], U=[h2|ui| • • • |u(_i], 

S=diag,x,{a,0,---,0} (14) 

where hi = ||^7||> h2 = and a = {1 + ||hi||^)~5. Here {vi,--- ,v;_i} and {ui,--- ,ui-i} are arbitrary 
orthonormal basis for the null-spaces of the hi and h2 respectively. 

To summarize, having s and r fixed, the optimal structure of W = USV^ and the corresponding SNR value 
are: 

W = ahzhf (15) 

^ i+iihif +iih2ir ^ ^ 

where a = (l-|-||hi||^)~5, hi = v^His, and h2 = x/^H^'^r. This result finalizes the maximization with respect 
to W. 

Step 2) Maximization with respect to s and v: 
From (16) we see that 7 is increasing both in ||hi|| and ||h2||. Therefore, for maximizing the SNR, we should 
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Zi~CN(p,I) 



ai 



Z2~CN[0, 1) 



(a) 

Zi~CJV(0,/) Z2~CiV(0,/) 



a 



Relay 



g 



(b) 

Fig. 3. (a) Optimal unquantized scheme for MIMO AF without the direct link, where Hi = A*B^ and H2 = F*G^. (b) Quantized 
scheme for MIMO AF without the direct Unk. 



maximize ||hi|| and ||h2||, subject to ||s|| = ||r|| = 1. Considering the definitions of hi and h2, the optimal value 
is achieved by letting s be the strongest right singular vector of Hi and r be the strongest left singular vector of 
H2. This concludes the maximization in step 2. ■ 
Substituting the optimal solution, found in Theorem 1, in equation (16) reveals the optimal SNR: 

1+7^ + 72 

where 

71*= maxPi||His||2 = Pi02^ 

l|s||=l 

72*= maxP2||H2s||2 = P2V'?- (18) 

l|s||=l 

The optimal solution in Theorem 1 verifies the optimality of the scheme in Fig. 3a, where the Tx and relay use 
the strongest right singular vectors of the Tx-relay and relay-Rx channel matrices for beamforming. Assuming that 
the relay knows Hi and the receiver knows H2, the optimal structure can be achieved if: 

• The relay informs the transmitter of bi, the strongest right singular vector of Hi. 

• The receiver informs the relay of gi, the strongest right singular vector of H2. 

Considering this, we continue the problem in Subsection III.B by characterizing the codebooks that should be used 
for quantizing the optimal beamforming vectors. 

B. Quantization Scheme 

Fig. 3b presents a scheme which mimics the optimal scheme (Fig. 3a), with the difference that the Tx and relay 
beamforming vectors belong to certain codebooks with finite cardinality. 

In Fig. 3b, the Tx beamforming vector b should belong to a codebook Ci{Ni,6i) shared between the Tx and 
relay, and similarly, the relay beamforming vector g should belong to a possibly different codebook C2(A^2, S2), 



February 2, 2008 



DRAFT 



11 



which is shared between the relay and Rx. The relay and Rx use a and f for receive beamforming, respectively. 
All transmit/receive vectors a, b, f and g are assumed to be of unit norm, and a = {1 + Pija^Hibp)^^/^ in 
order to satisfy the relay power constrainO- The received SNR of the quantized scheme can be easily shown to be 



equal to; 



where 71 = Pi 



71 72 

7 = T— — , (19) 

1 + 71+72 



a^Hib and 72 - P2 FHag 



2 



are the received SNRs of the Tx-relay and relay-Rx channels. 
As 7 is increasing both in 71 and 72, we should maximize these quantities to maximize the SNR of the quantized 
scheme. This is accomplished, as in Section II, by letting a and f to be matched to Hib and H2g, and, choosing 
b and g based on: b — argmaxweCi AHHiWp and g ~ argmaxweC2 -f2||H2w||^. The corresponding received 
SNR values are 

71 = max PillHiwf , 72 = max P2||H2wf , (20) 

wGCi WGC2 

and the maximum received SNR of the quantized scheme 7 can be computed by substituting these quantities in 
(19): 

7172 

7 = -I , - , - ■ (21) 

1 + 71+72 

In Appendix II. A, we use the distributions of the optimal beamforming vectors bi and gi for Rayleigh channels 
to compute the following upper bound for the total loss in SNR caused by quantization. 

2(r?j-l) 



E{7*} - E{7} < 2mnPi 1-Ni 



^^,2(„-l) 




+ 2nlP2 \^1-N2 i^-fj ^1-^ ] I , (22) 

This upper bound is decreasing in Si and 62 for any m > 1 and n > 1. Therefore, to minimize this upper bound, 
we should maximize the minimum distances Si and 62- This is exactly the criterion used in Grassmannian codebook 
design and proves the efficiency of these codebooks for quantizing the optimal beamforming vectors. In Section 
V, we present simulation results which compare the performance of the Grassmannian quantizers with the optimal 
(unquantized) scheme and other possible quantization schemes. 

IV. MIMO Amplify and Forward Relay Channel with the Direct Link 

In this section the direct link is included in the system model (Fig. 4). The optimal unquantized scheme is derived 
in Subsection IV.A and the quantization scheme is presented in IV. B. Finally, in IV.C we introduce a modified 
quantized scheme, which significantly reduces the number of feedback bits with a negligible degradation in the 
system performance. 

^The Tx power constraint is automatically satisfied by assuming |jb|| = 1. 
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constraint 



Tx power z^-CNiOJ) 
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Z2~CiV(0,/) 



w 



z„~CiV(0,/) 



Fig. 4. Half-duplex MIMO AF relay channel model with direct link between the transmitter and the receiver. 



A. Optimal Unquantized Scheme 

Consider the half-duplex MIMO-relay link in Fig. 4. At the first time slot, the relay is silent and the Rx receives 
its symbol. At the second time slot, the Tx is silent and the relay amplifies and forwards its signal (received in the 
first time slot) to Rx. The receiver has access to two received symbols j/o and yi separated in time: 

2/0 = \/^r^Hosa; + r^zo 



yi = /PT^rf HaWHisa; + rf (^/^HaWzi + Z2 , 

The receiver computes the linear MMSE combination of j/g ™d yi to compute the output symbol Xo.^t'- 

Xout = aoyo + aiyi. 

By proper choice of ao and ai the output SNR ijf|: 

7 = 7o+7r., (23) 

where 70 and 7^ are the received SNR values of the direct link and the Tx-relay-Rx link. Therefore, the total SNR 
is maximized if the received SNRs of the direct and relay links are maximized. The only common parameter in 
maximizing these two quantities is the Tx beamforming vector s. 

By fixing s and following the same steps in Sections II and III, the optimal values of other parameters can be 
easily derived, as showed in Fig. 5a. In the first time slot, the relay and the Rx should respectively match to His 
and HqS at their inputs. In the second time slot, the relay maps its normalizecj^ symbol to gi the strongest right 
singular vector of H2 and the receiver uses fi, the strongest left singular vector of H2, for receive beamforming. 
The corresponding received SNRs of the direct link and relay link are: 

70 = Po||Hos||2 

Ir = T-^, (24) 

1 + 71+72 

^This is a result of the MMSE combination, or MRC after scaling the noise levels of the symbols yo and y\. 
*To meet the relay power constraint. 
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Zi~CW(0,/) a 



Z2~CN[0,I) 



.9\ 



iPnH, 



(a) 



IPaH, 



(b) 













fl 









• ^out 



Z|,~CiV(0,/) 
Z2~CiV(0,/) 



«0 



i yo 

z„~CW(0,/) 



Fig. 5. (a) Optimal unquantized scheme for MIMO AF with the direct channel. In the first time slot, the relay and the Rx match to His* 
and Hos*, respectively, (b) Quantized scheme for MIMO AF with direct link. In the first time slot, the relay and the Rx match to His and 
Hqs. respectively. In the second time slot, the relay matches to H2g. When s is replaced by s* in (31), we will refer to this system as the 
"properly quantized scheme". 



where 71 = Pi||His|p and is the maximum received SNR of the the relay-Rx Hnk: 72 = P2||H2gi||^ ~ ^2V'i- 
By combining (23) and (24) the total received SNR is: 

Pi||His|p7l , p ||„ ,||2 
and therefore, the optimal s can be expressed as: 

where A — and fi — The corresponding total received SNR is: 

where 7J = Po||Hos^||2 and 7^ = Pi||His*||2. 

The objective function of the problem in (25) has multiple local maximum points and moreover, the global 
maximum point is not unique^. This problem does not appear to have an analytic solution and as a result we use 
a numerical approach to perform this optimization, which will be described in Section V. 

'if s is a global maximum point, so is e^^s, for any 6 g R. 
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Despite the fact that we do not have a closed form expression for the solution of problem (25), we are still 
able to identify the distribution of the solution for Rayleigh fading channels. The main result of this section is the 
following theorem. 

Theorem 2: For independent Rayleigh channel matrices Hq and Hi, the optimal Tx beamforming vector s* that 
maximizes the total received SNR (or equivalently the objective function in (25)) is uniformly distributed on the 
unit sphere in C", where m is the number of Tx anteimas. 

Proof: See Appendix I. ■ 

Note that if we had a single channel from the transmitter to the receiver, the optimal Tx beamforming vector 
would be uniformly distributed on the unit sphere in C" (see Section 11). Interestingly, Theorem 2 states that the 
optimal Tx beamforming vector is still uniformly distributed on the unit sphere, when there are two independent 
parallel channels from the transmitter to the receiver. This is basically due to the independence of Hq and Hi, and 
the specific properties of the Rayleigh channel matrices. 

The result in Theorem 2 is used in Appendix n.B to derive an SNR loss upper bound, similar to (2) and (22), 
which justifies use of the Grassmannian codebook for quantizing the optimal Tx beamforming vector s*. 

B. Quantization Scheme 

Having identified the optimal scheme, we continue by considering the quantization scheme in Fig. 5b. In the 
first time slot, the Tx uses s for beamforming, and relay and Rx match their receive vectors to His and Hqs. In 
the second time slot, the relay scales its symbol and uses g for beamforming and Rx matches to H2g. The Tx-Rx, 
Tx-relay, relay-Rx and total received SNR values is given by: 



We need to maximize (27) with respect to the Tx and relay beamforming vectors s and g, which belong to certain 
codebooks with finite cardinalities. As in Section III, we assume that the codebooks Qi\{N\^5\) and Q'liNi-i^'i) 
are shared between Tx-relay and relay-Rx, respectively. Clearly, g should be chosen to maximize 72: 



The corresponding relay-Rx received SNR is: 72 = maxwec2 ^2||H2w|p. 

For choosing the proper s, we need to know both Ho and Hi. We continue the problem here by assuming that 
the relay knows Hq in addition to its channel Hi. This assumption will be relaxed in IV.B.2. 

1) Complete Knowledge o/Ho at the Relay: If the relay knows both Hq and Hi, then based on (27) the best 
vector s should be chosen as follows: 



7o = fo||Hosf, 7i = Pi||Hisf, 72 = P2||H2gf 




(27) 



g = arg max P2IIH2WI 



weC2 



(28) 





where A = ^ and /i = ^. The maximum total received SNR of the quantized scheme 7 can be computed by 
substituting (28) and (29) in (27). 
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In Appendix II.B, we use the distribution of s*, given in Theorem 2, to prove the following bound on the SNR 
loss caused by quantization. 

E{7*} - E{7} 

/ .gs^ 2(m-l) 

< 2 (mZPo + mnPi) 1-Ni ( -j 

+ 27^/F2 |^l-iV2 (^y ) ' 1 I- ^^^^ 

This upper bound is decreasing in Si = S{Ci) and 62 = S{C2) for any m,n > 1 and justifies the use of 
Grassmannian codebooks, Ci and C2, for quantizing the optimal Tx and relay beamforming vectors, and gi. 

2) Partial Knowledge 0/ Ho at the Relay: As mentioned earlier, the computations in IV.B.l are based on the 
assumption that the relay knows Hq completely. In reality, however, the Rx needs to quantize Hq and send it 
to the relay. We should note that, the only way that Hq contributes to the problem in (28) is through the term 
||How||^, which can be expanded as follows: ||How||^ = '^i^l^f^'^P' where i^i's and e^'s are the singular 

values and right singular vectors of Hq and Rq — rank(Ho). Therefore, the relay only needs to know the singular 
values and the right singular vectors of the direct link channel. Since our focus in this paper is on the vector 
quantization feedback schemes, we assume that the relay knows the singular values completely but has only access 
to the quantized versions of the singular vectors. 

For quantizing the singular vectors, the Rx and the relay share a codebook Co(A^o, <5o), which is possibly different 
from C2 (used for determining g). We assume that the Rx quantizes each vector e,; to a vector G Co that is 
closest to e;. 

— arg min (i(w,ei). 

wECq 

Having i/fs and e/s at the relay, the problem of finding the Tx beamforming vector s* can be reformulated aj^ 

i* = arg max J^^"!"' ^ + fij^ |ef wj^, (31) 
weCi ||Hiw||^ + A ^ 

where A = ^-p^, = and 72 = maxweC2 ^2 ||H2w|p. The total received SNR 7"^ can be computed by 
substituting (28) and (31) in (27). Finally, the loss in the received SNR can be bounded as follows (see Appendix 

*Here, we have used the notation s* to distinguish this vector form the vector s in (28), where we were assuming that the relay knows Hq 
completely. 
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n.c). 

E{7*}-E{r} 

/ /X \ 2(m-l) / c 

(I)--" (.4)) 

+ 4mZPo(l-A^o(|)''""'(l-|)). (32) 

The upper bound in (32) is decreasing in 6o — d{Co) for any m > 1. This justifies use of the Grassmannian 
codebook to quantize the singular vectors of Ho, since it has the maximum minimum distance Sq. The same 
conclusion holds for Ci and C2, since the upper bound in (32) is decreasing in 61 and 62 for any m,n> 1. 

To summarize the results, all three codebooks Co, Ci and C2 need to be Grassmannian codebooks to minimize 
the upper bound of the loss in the total received SNR. We refer to the scheme, determined by (31), as the "properly 
quantized scheme". In the following we outiine the steps in determining ihe beamforming vectors of the "properly 
quantized scheme" (Fig. 5b). 

1) The Rx uses a Grassmannian codebook C2, shared between the Rx and the relay, to quantize g, the strongest 
right singular vector of the relay-Rx channel H2. The label of the quantized vector is sent to the relay. The 
relay uses this vector for its beamforming in the second time slot. The Rx also sends the SNR value 72 to 
the relay. This will be used in step 3. 

2) The Rx quantizes the right singular vectors of the Tx-Rx channel using a Grassmannian codebook Cq, which 
is shared between the Rx and the relay. The labels of the quantized vectors and the singular values fj's are 
sent to the relay. 

3) The relay forms the objective function in (31) and maximizes it over the Grassmannian codebook Ci, which 
is shared between the Tx and the relay. The relay sends the label of the maximizing vector to the Tx. The 
transmitter uses this vector for its beamforming in the first time slot. 

Before concluding Section IV, we introduce a modified scheme which performs very close to the "properly 
quantized scheme" but requires fewer number of feedback bits. 

C. Modified Quantized Scheme 

Consider the problem of determining the Tx beamforming vector for the quantized scheme in Fig. 5b (see equation 
(31)). There are two links between the transmitter and the receiver; the direct (Tx-Rx) link and the Tx-relay-Rx 
Unk, which we refer to as the relay Unk. If the direct Unk is much weaker than the relay link and can be ignored 
safely, our problem reduces to the problem in Section III and the relay does not need to know anything about the 
direct hnk channel Hq. On the other hand, if the relay link is very weak and can be ignored, the only thing that we 
need to know about Hq is its strongest right singular vector. Therefore, in both of these extreme cases we do not 
need to have any knowledge of Hq other than its strongest right singular vector. Based on this intuition, we propose 



) 
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a new scheme, referred to as the "modified quantized scheme", in which the Rx only quantizes the strongest right 
singular vector of Hq and sends the corresponding label (and the largest singular value i^i) to the relay. The relay 
then determines the proper Tx beamforming vector by forming the following problem. 

Smodified = arg max \ ~- + /it/i |ef wp, (33) 

weci IIHiwII-^ + A 

where A and ji have the same definitions as in (31). 

The "modified quantized scheme" requires much fewer number of bits, since it only quantizes one singular vector 

(see step 2 for the properly quantized scheme). Our simulation results show that the "modified quantized scheme" 

performs very close to the "properly quantized scheme", as we will see in Section V. 

V. Simulation Results 

In this section, we provide simulation results for the scenarios discussed in the Sections III and IV. The results 
are divided into two subsections. In V.A the direct link between the transmitter and the receiver is ignored, as in 
Section III (see Fig. 2). In V.B, the simulation results are presented for the case where the direct link is present in 
the model (Fig. 4). 

The general setup for the simulations is as follows. The input symbols belong to a BPSK constellation with 
unit power. The entries of the channel matrices, which model the i.i.d Rayleigh fading channels, are generated 
independently according to CA/^(0, 1). To model quasi-static fading channels, the simulation time is divided to 
20, 000 coherence intervals, each consisting of 200 symbols. The channels are assumed to be constant over each 
coherence interval and to be independent from one interval to the other. The simulation results compare different 
(quantized and unquantized) schemes from the bit-error-rate (BER) point of view. 

A. MIMO AF Relay Channel without the Direct Link 

In this section, the direct link is not considered in the simulation model (Fig. 2). All of the stations (Tx, relay 
and Rx) are assumed to have two antennas (m = n = Z = 2). The relay-Rx link SNR is fixed at P2 = 8dB and the 
BER values have been recorded for different values of the Tx-relay link SNR Pi. For the quantization purposes, 
the Tx and relay share a codebook Ci of size iVi. Similarly, the relay and Rx share a codebook C2 of size 7V2. 

Fig. 6 compares the performance of the "optimal unquantized scheme" (Fig. 3a) with the performance of the 
Grassmannian codebooks Ci and C2 of sizes A^i = A^2 = 4 or 8. The Grassmannian codebooks are adopted from 
[5]. The total number of the feedback bits used by the Grassmannian quantizer is log2 A^i + log2 N2 which equals 
4 or 6 bits for A^i = iV2 = 4 or 8. As Fig. 6 shows, we can get very close to the optimal scheme with only a few 
number of bits per each coherence interval. We have also simulated the performance of the Alamouti code, to show 
the high power gain that can be achieved by using the Grassmannian codebooks compared to space-time codej^ 

'in the implementation of tlie Alamouti code, we have assumed that the relay does not perform any decoding on its received symbols, 
to comply with the amplify-and-forward assumption. The relay decomposes the symbols coded by the Almouti code, and performs another 
Alamouti coding on the decomposed symbols and sends the scaled symbols through the relay-Rx channel. 
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Fig. 6. Comparison of the performance of Grassmannian quantization scheme witli the optimal (unquantized) scheme and the Alamouti 
space-time coding. The relay-Rx link SNR is fixed at 8dB. 

In Fig. 7 we compare the performance of the Grassmannian quantizers with other quantization schemes. For the 
MMSE quantization scheme, the Rx and relay quantize every entry of the channel matrices H2 and Hi according 
to the MMSE criterion and send the quantized channel matrices to the relay and Tx, respectively. The Tx and the 
relay perform singular value decomposition on these quantized matrices and use the corresponding strongest right 
singular vectors for beamforming. We have assumed that the quantizer uses two bits to quantize each channel entry, 
i.e., one bit for each of the real and imaginary parts. For m = n = I = 2 this results in 2{mn + nl) = 16 bits 
which should be compared to the small number of feedback bits in the Grassmannian scheme. 

Fig. 7 also compares the Grassmannian quantizer with the random quantization scheme. The random quantizer 
uses a set of randomly selected vectors on the unit sphere as its quantization codebook. The performance of the 
random scheme has been averaged over ten such codebooks. As Fig. 7 shows, the Grassmannian scheme shows 
considerable gain as compared with the random quantizer However, this gain decreases as the codebook sizes are 
increased from 4 to 8. The main advantage of the random codebooks is that they are easy to generate as compared 
with the Grassmannian codebooks. 

B. MIMO AF Relay Channel with the Direct Link 

In this section, we simulate the system model in Section IV, where the direct link has been included in the 
analysis. All the stations are equipped with three antennas {m = n = I = i). 

Fig. 8 compares the "optimal unquantized scheme" (Fig. 5a) with some other unquantized schemes. The Tx- 
relay and relay-Rx link SNR's are fixed at -Pi = P2 = 2dB and the BER values are recorded for different values 
of the direct link SNR Pq- For the optimal scheme, we use the gradient descent method for determining the Tx 
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Fig. 7. Comparison of the performance of the Grassmannian quantizer with the MMSE and random quantizers. The relay-Rx link SNR is 
fixed at 8dB. 



beamforming vector from (25). The constraint ||s|| = 1 is eliminated by the change of variable s = jj^. 

The curve marked by V shows the performance of the scheme that ignores the direct link in determining the Tx 
beamforming vector For this scheme, the Tx beamforming vector is always set to the strongest right singular vector 
of the Tx-relay channel. As expected, the performance of this scheme diverges from the optimal scheme as the 
direct link gets stronger The next curve, marked by □, shows the performance of the scheme which considers only 
the stronger link for determining the Tx beamforming vector In this scheme, the Tx switches between the strongest 
right singular vectors of the Tx-relay and Tx-Rx links depending on their received SNR values. The last scheme, 
called the "modified unquantized scheme", has the same structure as the "optimal unquantized scheme" with the 
difference that the relay only considers the strongest singular value and singular vector of Hq in formulating the 
problem of determining the Tx beamforming vector This problem is exactly the same as the problem (25), used by 
the optimal scheme, except that ||Hos|p is replaced by j/^|e^sp, where i^i and ei are the strongest singular value 
and right singular vector of Hq. In Appendix III, we show that the average SNR loss of this scheme with respect 
to the optimal scheme is at most 1.24dB for the system with m = n = I = 3 antennas. As the simulation results 
in Fig. 8 verify, the modified unquantized scheme performs very close to the optimal scheme. This unquantized 
scheme is the basis for a quantization scheme that has been referred to as the "modified quantized scheme" in 
Section IV (see (33)). 

In the next two simulation setups, we study the performance of the quantized schemes. As discussed in Section 
IV, the scheme consists of three codebooks Co, Ci and C2 of sizes Nq, Ni and N2- The codebook Co is used for 
quantization of the direct link channel Ho. The codebook C2 is used to determine the relay beamforming vector in 
the second time slot. The codebook Ci determines the Tx beamforming vector in the first time slot. Fig. 9 shows 
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Grassmannian, 4 vectors 
Grassmannian, 8 vectors 
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Fig. 8. Comparison of the optimal unquantized scheme with other unquantized schemes. The Tx-relay and relay-Rx linlc SNRs are fixed at 
Pi = P2 = 2dB. 

the performance of the "properly quantized scheme" with Grassmannian codebooks of sizes Ni = N2 = N3 = 8, 16 
(see the three steps for properly quantized scheme in Section IV). The Tx-relay and relay-Rx link SNRs are fixed 
at Pi = P2 = 2dB and the BER values have been recorded for different values of the direct link SNR Pq. The 
Grassmannian codebooks are adopted from [14]. 

The figure also shows the performance of the Grassmannian codebooks with "modified quantized scheme" (see 
(33)). This scheme shows a negligible performance degradation with respect to the "properly quantized scheme", 
but requires fewer number of feedback bits. As an example, we compare the total number of bits required by 
the properly quantized and the modified quantized scheme. For quantization of the scalar values, we assume a 
hypothetical quantizer which requires b bits for quantizing a scalar quantity. Recall the three steps of the properly 
quantized scheme in Section IV. For step one, we need log2(A^2) bits for quantizing g and b bits for quantizing 72. 
In step two, we need Rq (log2 {No) + b) for the "properly quantized scheme" and log2 ( A'q ) + 5 bits for the "modified 
quantized scheme", where Rq = rank(Ho). Finally, for the third step, we need log2(Afi) bits for quantizing the Tx 
beamforming vector. Therefore, we need a total of (1 + i?o)f' + log2(^o^''-^^i^2) bits for the "properly quantized 
scheme" and 26 -I- log2(A''o-/ViiV2) bits for the "modified quantized scheme". Table I compares these values for 
N = Nq — Ni = N2 = 8, 16, and m = n — I — 3. Here we have assumed a full rank channel matrix Hp. 

Fig. 9 also shows the performance of the MMSE quantizer. This quantizer requires 2{mn + ml + In) bits for 
quantizing the channel matrices and b bits for quantizing 72. 

Fig. 10 compares the performance of the same schemes of Fig. 9 in a different scenario. For this figure, the 
direct link and relay-Rx link SNR are fixed at Pq = — 4dB and P2 = 2dB. The BER values have been recorded for 
different values of the Tx-relay link SNR Pi. Once again, we see that the performance of the "modified quantized 
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TABLE I 

Comparison of the Number of the Feedback Bits for Different Quantization Schemes 



Scheme 


Number of feedback bits 


N = 8 


Af=16 


Properly quantized 


15 + 46 


20 + 46 


Modified quantized 


9 + 26 


12 + 26 


MMSE 


54 + 6 




-10 -8 -6 -4-2 2 

Direct link SNR (P^) dB 



Fig. 9. Comparison of the properly quantized scheme with modified quantized and MMSE quantization schemes. The Tx-relay and relay-Rx 
link SNRs are fixed at Pi = Pa = 2dB. 

scheme" is very close to the "properly quantized scheme". 

VI. Conclusion 

In this paper, we derived the optimal (unquantized) Tx/Rx beamforming vectors and the optimal relay weighting 
matrix to maximize the total received SNR of MIMO AF relay channel both with and without the direct Tx-Rx 
link. We showed that the Grassmannian codebooks are appropriate choices for the quantization codebooks in the 
quantized scheme. We proposed a modified quantized scheme which performs very close to this quantized scheme 
and requires considerably fewer number of feedback bits. Finally, the analytical results were verified by comparing 
the performance of the unquantized and quantized schemes under different scenarios. 
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-• — Modified quantized scheme, 16 vectors 
-A — IVIMSE quantization 
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Fig. 10. Comparison of the properly quantized scheme with modified quantized and MMSE quantization schemes. The direct link and relay-Rx 
link SNR are fixed at Pq = "4dB and P2 = 2dB. 

Appendix I 

The Distribution of the Optimal Beamforming Vector s* 

In this appendix, we show that there exists a solution s* to the problem (25) that is uniformly distributed on the 
unit sphere in C™, where m is the number of Tx antennas. 
The problem (25) is repeated here: 

= arg max , J"^;'J[ . + /i||Hosf, (I.l) 
||s||=i ||His||^ + A 

Consider Ho = UoSqV^ and Hi = UiSiVf as the SVD of Ho and Hi. Clearly: ||Hos|| = ||SoV^s|| and 
||His|| = llSiVf'^sll, since Uo and Ui are unitary matrices. 

It is easy to check that s* ~ Vo?7(So, Si, Vf^Vo) is a solution to (I.l), where the function 77(-, •, •) is defined 
to be a solution to the following problem: 

77(So,Si,VfVo) 

aef llSiVfVotf , 2 

= are max — tt h u Snt . (1.2) 

l|t|l=i ||SiVfVotf + A 

If we fix So and Si, the solution s*, identified above, can be expressed as a function of Vo and Vi: 

= C.o,.,(Vo,Vi) =^ Vor;(So,Si,VfVo). (1.3) 
Now, for any unitary matrix Q, we have the following from (1.3). 

C.„,., (QVo, QVi) = QC,^^,,^ (Vo, Vi) = Qs^ 
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For a Rayleigh channel matrix Hq, we know the the random matrix Vq is independent of Sq and its distribution 
does not change by pre-multiplication by a unitary matrix Q. The same argument holds for Hi, Vi and Si. 
Therefore, conditioned on Sq and Si, the matrix QVq has the same distribution as Vq, and similarly QVi has 
the same distribution as Vi . Since the Tx-Rx and Tx-relay channels are assumed to be independent, Vq and Vi 
are also independent, and therefore the joint distribution of (Vq, Vi) is also the same as the joint distribution 
of (QVo,QVi). Hence, any arbitrary function of these pairs will have the same distribution. By applying this 
to the function Cso.^i (')' we conclude that s* = Csq.ei (^o, ^i) and Qs* = Ceo.^i (QVq, QVi) have the same 
distribution. Since this it true for any unitary matrix Q, we conclude that s* is uniformly distributed on the complex 
unit sphere, conditioned on Sq and Si. 

Note that if the conditional distribution of s* is uniform, its unconditional distribution is also uniform. Moreover, 
the random vector s* is independent of the random matrices So and Si, since its conditional and unconditional 
distributions are the same. 



Appendix II 
Proof of SNR Loss Upper Bounds 

In this appendix, we prove the SNR loss upper bounds of (22), (30) and (32) in three separate sections. We will 
first prove the following lemmas, which are frequently used in these sections. 
Lemma 1: For normegative variables xi, X2, yi and y2, we have: 

XlVl X2y2 



< \xi - .T2I + \yi - y2\■ 



^+Xl+yl l+X2+y2 

Proof: We use the following inequahty, which can be easily verified by basic computations. For any a > 0, 
b> and c > we have: 



a + c b + c 
Now the expression in Lenmia 1 can be written as: 

xiyi X2y2 



< -\a-b\. 
c 



(II. 1) 



l+X2+y2 

xiyi xiy2 



= Xl 



1+xi+yi l+a;i+y2 
2/1 



xiy2 



X2y2 



2/2 



l+xi+y2 l+X2+y2 

Xl 



yi+{xi+l) y2+{xi+l) 
2/2 



+2/2 



X2 



a;i+(2/2+l) x2+(y2+l) 



I2/1 - 2/2I + 



Xl - X2\ <\xi - X2 \ + \yi - y2\ 



Xl+l J/2+1 

where (a) is the triangle inequality and (b) results from (II. 1). Finally (c) results from j^^<l and ^^^<1, since 
Xl and 2/2 are normegative. ■ 
Lemma 2: For the matrix H e C^^^ with independent CAf{0, 1) entries, we have: E {J^i f^i} = PQ, where aj's 
are the singular values of H. 
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Proof: Let H = [/i^], where h,j ~ CN{Q, 1). We have: 



E|E^'}=E{Trace(HH^)} 



Lemma 3: Consider the codebook C = {wi, W2, • • • , wjv} and the matrix H with cji's as its singular values. 
For any unit vector s define e C as the closest vector in codebook C to s and let d(-.{s) '="' d{s,S(-,), where 
d{-, •) is the distance function defined in (1). Then, we have: 



IIHsf -||Hsjn<2||^a2^rf,(s), 



Proof: For arbitrary unit vectors u, v and w, we have the following from the triangle inequaUty: 

|d(u, v) — (i(v,w)| < d{u,w). 

On the other hand, 

\d{u, v) + d(v, w)| < |d(u, v)| + |d(v, w)| < 2. 
By multiplying the both sides of these inequalities we get: 

I (i^ (u, v) - (v, w) I < 2d(u, w) . 
Considering the definition of the distance function d{-, •) in (1) we have: 

< 2d{u,w). 



(n.2) 



Now, if the right singular vectors of H are denoted by Vj's, we have: 



IIHslP- IIHwlP = 



and by applying (II.2), we get: 



IHsll 



(11.3) 



The proof will be complete after substituting w in (II.3) by s^, . ■ 
Lemma 4: Consider the codebook C(A'^, 5) and the function (•) defined in Lenrnia 3. For the random vector 
s e C" uniformly distributed on the unit sphere we have: 

2(m-l) 



E{ii„(s))<l-W(^- 
Proof: The proof is based on the arguments given in [5]. 
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A. Proof of the Upper Bound in (22) 

The optimal unquantized SNR 7* and the quantized scheme SNR 7 are given in (17) and (21), which are repeated 
here: 

7* = T-^' 7=-^, (II.4) 
1 + 71+72 1+71+72 

where 7*, 72, 71 and 72 are defined in (18) and (20). Clearly 7^" > 71 and 72 > 72. and therefore, 7* > 7. Our 
goal is to bound 7* — 7. For this purpose, we need the following definitions. 

'^ifo wa V. m2 ^,'^±fv MXT „ II 2 ^.f^lf 7i72 



7i = -Pi IIHibc Jr, 72 = ^2||H2gc Jr, 7 = 



l+7(+7r 



where b^-,^ is the closest vector in the codebook Ci to bi, and g^,^ is the closest vector in the codebook C2 to 
gi. Note that, by the notation of Section 111, bi and gi are the strongest right singular vectors of Hi and H2. By 
considering the definitions of 71 and 72 in (20) and the fact that b^^ G Ci and g^^ € C2, it is clear that 71 > 7^ 
and 72 > 72, and therefore, 7 > 7'. Hence, we can write: 

. . 7i*7^ 7l72 

'y — ^ S: ^ — T 



l+7i+7j 1 + 71+72 

(a) 

<(72 - 72) + (7i*-7i), (n.5) 

where for (a) we have used Lemma 1 . The terms on the right side of (II.5) can be bounded as follows. 
Noting the definitions of 7^, 7( we have: 



7i 



-7i=Pi (||Hibif-||Hib,J|2) < 2Pi (^J2^^ ^c,(bi), 



where for (b) we have used Lemma 3, and (pi's are singular values of Hi. The term 72—72 can be similarly 
bounded. Combining these bounds with (II. 5), we get the following upper bound: 

7* - 7 < 2 0?^ rfc, (bi) + 2 ^ i^ij d,^ (gi), (II.6) 

where Vi's are singular values of H2. Noting that the singular vectors bi and gi are uniformly distributed on the 
unit spheres (of the corresponding dimension) and are independent of the singular values, we can apply Lemma 2 
and 4 to (11.6) to achieve the upper bound in (22). 



B. Proof of the Upper Bound in (30) 
Define: 



, .def 7181)7282) , . ^ 

7(S1,S2 = — — 1—— — -— +70S1 , 

1 + 7i(si) +72(82) 



def 



where 7i(s) = Pi||His|f , for i = 0, 1,2. With these definitions, the SNR of the optimal unquantized scheme 7* 
and the SNR of the quantized scheme 7 can be expressed as: 



7*=|mp^7(si,S2) =7(s^gl) = T^^T^ + 7o* 

l|S2|| = l 

7 = max 7(wi,W2) =7(s,g), (11.7) 

wi GCi 

W2eC2 
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where gi is the strongest right singular vector of H2, and s*, s and g are defined in (25), (29) and (28). Also 
lo = 7o(s*), 7? = 71 (s*), and 7^ = 72(gi)- 

Our goal is to bound the SNR loss 7* — 7. For this purpose, we need the following definitions. 

7'^=^>-^+7^, 
1+71+72 

7o = 7o(ScJ, 7i = 71 (Sc,), 72 = 72(gcJ, 

where s* ^ G Ci is the closest vector in the codebook Ci to s*, and g^^ G C2 is the closest vector in the codebook 
C2 to gi. 

Noting the above definitions, it is clear that 7* > 7 > 7' and we can write: 

7(72 



7*— 7 < 7*— 7' < 



l+7i*+72 l+7i+72 



+ l7o-7^l 



<l72"72l + W-7;i + l7o*-7ol 

<:^^P2|||H2gl||2-||H2g,jn+Pl 

+ Po|||Hos*||2-||Hos* f 



< 2P2 (E^') rfc.(gi) +2Pi {12^^ ^c,(s'^) 

+ 2^0 (E^^) (U.8) 

where we have used Lemma 1 for (a). In (b), {7*}|=o {7i}i=o ^^^^ been replaced by their definitions. Finally, 
(c) results from Lenama 3. 

We know from Appendix I, that s* is uniformly distributed on the unit sphere and is independent of the eigenvalues 
Vi's and (pi's. The same argument holds for the singular vector gi and the singular values tpi's. Considering this, 
we can take expectation from both sides of (II.8) and use Lemma 1 and Lemma 4 to achieve the upper bound in 
(30). 

C. Proof of the Upper Bound in (32) 
As in Appendix II.B, the SNR of the optimal unquantized is given by: 

||Sl|| = l 
l|S2|| = l 

where s*, gi and the function 7(-, •) are defined in Appendix ILB. As described in Section IV.B.2, the quantized 
beamforming vectors are determined from: 

g = arg max 71 (w), s* = arg max x(w, g), (II.9) 

WGC2 wGCi 

where 

/ ^def 71(81)72(82) ,d\^2\-H |2 ^TTim 

x(si,S2) = , . . \. / , +Po>,^'» Si . (ILIO) 
1 +71 (si) +72(82) ^ ' ' 
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In (11.10), Vi's are the singular values of Hq and e^'s are the quantized version of e^'s which are the right singular 
vectors of Hq. The SNR value resulted from the choices in (II. 9) is: 

7*=7r,g)- (11.11) 
Our goal is to bound 7* — 7*. For this purpose, we need the following definitions from Appendix II.B: 

s = arg max 7(w, g) 

wGCi 

7 = 7(s,g). (11.12) 
The SNR loss 7* — 7* can be expressed as: 

7* -7* = (7* -7) + (7 -7*)- (11.13) 
The first term has already been bounded in Appendix II.B. To bound the second term we will need the result proven 
in Lemma 5 (at the end of this section). Let 9 = 2Pq ^I'^co (^«)' '■1'^'^ have: 

(a) (b) 

7 = 7(s, g) < x(s, g) + 6* < x(s*, g) + 

(c) 

< 7(s*, g) + 2^ = 7* + 26, (11.14) 

where in (a) and (c) we have used Lemma 5, and (b) results from (11.9) and the fact that s e Ci. By combining 
(n.14), (n.13) and (11.8) we get the following upper bound: 



+2 (^A(^^0?j+Po(E-'))^c,(O 



+4Po5^!^fdco(ei). (11.15) 

i 

From Appendix 1, s* is uniformly distributed on the unite sphere and is independent of the singular values (piS 
and fj's. The same argument holds for the singular vectors gi and e^'s and the corresponding singular values ipiS 
and ViS,. By considering these facts and taking the expectation of both sides of (11.15) and using Lemma 1 and 4, 
we get the upper bound in (32). 

Lemma 5: For any unit vector s, we have: 

l7(s, g) - X(s, g)| < 2Po "i'^oo («^)- 

i 

Proof: Noting the definition of 7(-, •) in Appendix II.B, 

1 + 71 (s) + 72(g) V 



Therefore, 



l7(s,g) -x(s,g)|=Po 



(a) 



j:-!{Ks\'-\efs\') 
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where in (a), we have used (II. 1) in Lemma 3. Noting that Cj's are by definition the closest vectors in Co to e^'s, 
we have d(ej, Cj) = d^^ (ej) and the proof is complete. ■ 

Appendix III 

Comparison of the Optimal and Modified Unquantized Schemes 

In this appendix the following lemma wiU be used to bound the SNR loss of the modified unquantized scheme 
with respect to the optimal unquantized scheme. 

Lemma 6: Consider the SVD H = USV^ for an arbitrary matrix H e C'^", where U e ZY', V = [vi | • • • |v„] e 
ZY", and S = diagixn(o'i) '^2, • • • , tTr), where r = min{/, n}. Then for any unit vector s we have: 

a?|vfsf < ||Hs||2<a2|vfs|'+a2. 
Proof: Note that ||Hs|p = '^f I'^i^s^- The left side inequaUty in Lemma 6 is obvious, since a^ |v|^s|^ > 
for i > 1. The right side inequahty can be proven as follows: 

|2 . \ ^ 9 1 H" |2 



||Hs||2=a?|vfs|%$:a,^|vfs| 



(a) 



<af|vfs|^+ai5]|vfs| 



i>l 

<al |vf s|' +4Y^ K^\^ < ^1 + 
where (a) results from ct2 > Ci for i > 1. In (b) we are adding the nonnegative term ctI | v^s|^ and (c) results from 

n 

|vf s|' = \\V"sf = s"VV"s = s"s = \\sf = 1, 

since V is a (square) unitary matrix. ■ 
Considering the definition of the function 7(-, •) in Appendix n.B, the SNR of the optimal unquantized is given 
by: 

^* = „°^F S2) = 7(s*, gi), 

||Sl||=l 
l|S2|| = l 

where gi is the strongest right singular vector of H2 and s* = argmax||s||=i 7(3, gi). 

On the other hand the Tx beamforming vector of the modified unquantized scheme is determined by: 

s„oa./iea = arg max ^(s, gi), (III.l) 

where 

= \ ''^^ 71(81)72(82) p 2 |2 

^(si,S2) = -— 1—^ ^ + Pqi^i \ei si\ . 

1 + 71 (Si) +72(82) ' ' 

Here ui and ei are the largest singular value and strongest right singular vector of Hq, respectively. The corre- 
sponding SNR of the modified scheme is: 
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Noting the definitions of 7(-, •) and ^{■, •) and using Lemma 6, we have the following for any unit vector s: 

7(s,gi)<^(s,gi) + Poi^|, (111.2) 

where U2 is the second largest singular value of Hq. Taking the maximum of the both sides of (111.2) over the unit 
sphere, we get: 

7*=7(s*,gi) < e(s_,.,..„gi) +Po^^| 

<7(s„<.,,,,,„ gi) + Po4 = l^o.i,ie. + Po'^l (IH-3) 

where (a) results from the fact that ^(si,S2) is globally upper bounded by 7(51,82) for any si and 82 (Note the 
first inequaUty in Lemma 6 and the definitions of 7(-, •) and ^(•, •)). Taking expectation of both sides of (111.3), we 
get: 

E{7*} - E{7^„,,,,^J < PoE{ul}. (IIL4) 



On the other hand, 



7 = max 7(81,82) > 7(ei,gi) > PqJ'i, 

||si||=l 

l|S2||=l 



and therefore, £{7*} > PoE{j^i}- Combining this with (in.4), we get the following upper bound. 



or 



E{7*}dB - E{7_,,,,eJdB < lOlogio (1 + 



For Rayleigh channel matrix Hq e C^'^, this upper bound is equal to 1.24dB. 
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